-
Notifications
You must be signed in to change notification settings - Fork 5.8k
This commit contains all significant contributions until 23/8/2016 #761
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: 4.x
Are you sure you want to change the base?
Conversation
Implemented ImagePreprocessor a geneology of functors for preprocessing samples before they are fed to a CNN. Implemented TextImageClassifer as an abstraction of any object that provides a vector of probabillities given an image. A gluing class providing compatibillity to callbacks for OCRBeamsearchDecoder allows to plug any such object to the existing algorithms. Implemented DeepCNN a class that is supposed to load pretrained caffe models. Although several backends have been planed the only fully ope- rational one is the one employing caffe its self. GPU is supported. Implemented a class using a DeepCNN that implements word-spotting Implemented a python and a C++ demo demonstraiting the WorSpotting Class Implemented a class that generates sythetic text samples at high speeds and with a full pipeline of realistic distortions for training Word-spotting CNN's who need millios of samples. Implemented a python demo demonstraiting the usage of the text synthesizer
…y to the synthesizer with the simple highgui
…keLists.txt substituting configure_file with add_definitions.
… empty file as was originally in the master branch
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have several more comments:
DeepCNN_classdiagram.pdf
can be removed, because doxygen can generate inheritance diagrams- please resolve merge conflicts
- can we close #723 ?
@@ -1,7 +1,13 @@ | |||
#ifndef __OPENCV_TEXT_CONFIG_HPP__ | |||
#define __OPENCV_TEXT_CONFIG_HPP__ | |||
|
|||
// HAVE QT5 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actually this file is not usable, because there is no corresponding configure_file command in cmake script (like here).
HAVE_TESSERACT
works because the definition is being added to the compilation command (with add_definitions(-DHAVE_TESSERACT)
).
Actually I think we should use the dnn module to run the network, is it possible?
@@ -61,82 +65,126 @@ enum | |||
OCR_LEVEL_TEXTLINE | |||
}; | |||
|
|||
//base class BaseOCR declares a common API that would be used in a typical text recognition scenario | |||
//base class BaseOCR declares a common API that would be used in a typical text |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please, restore the formatting of all documentation comments and code.
@@ -43,6 +43,7 @@ | |||
#include "precomp.hpp" | |||
#include "opencv2/imgproc.hpp" | |||
#include "opencv2/ml.hpp" | |||
#include "opencv2/highgui.hpp" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is it needed?
message(STATUS "TEXT CAFFE CONFLICT") | ||
endif() | ||
|
||
find_package(Qt5Gui) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It will be better to use HAVE_QT5
from the main repository. (examples)
Ping @anguelos . |
This pullrequest changes
Implemented ImagePreprocessor a genealogy of functors for preprocessing
samples before they are fed to a CNN.
Implemented TextImageClassifer as an abstraction of any object that
provides a vector of probabilities given an image. A gluing class
providing compatibility to callbacks for OCRBeamsearchDecoder allows
to plug any such object to the existing algorithms.
Implemented DeepCNN a class that is supposed to load pre-trained Caffe
models. Although several back-ends have been planed the only operational
one is the one employing Caffe its self. GPU is fully supported but can only
be enabled and disabled globally.
Implemented a class using a DeepCNN that implements word-spotting
Implemented a python and a C++ demo demonstrating the Word-spotting
Class
Implemented a class that generates synthetic text samples at high speeds
and with a full pipeline of realistic distortions for training
Word-spotting CNN's who need millions of samples.
Implemented a python demo demonstrating the usage of the text synthesizer